20 research outputs found

    Zero-shot sim-to-real transfer of tactile control policies for aggressive swing-up manipulation

    Full text link
    This paper aims to show that robots equipped with a vision-based tactile sensor can perform dynamic manipulation tasks without prior knowledge of all the physical attributes of the objects to be manipulated. For this purpose, a robotic system is presented that is able to swing up poles of different masses, radii and lengths, to an angle of 180 degrees, while relying solely on the feedback provided by the tactile sensor. This is achieved by developing a novel simulator that accurately models the interaction of a pole with the soft sensor. A feedback policy that is conditioned on a sensory observation history, and which has no prior knowledge of the physical features of the pole, is then learned in the aforementioned simulation. When evaluated on the physical system, the policy is able to swing up a wide range of poles that differ significantly in their physical attributes without further adaptation. To the authors' knowledge, this is the first work where a feedback policy from high-dimensional tactile observations is used to control the swing-up manipulation of poles in closed-loop.Comment: Accompanying video: https://youtu.be/4rG-o2Cz3-

    Towards vision-based robotic skins: a data-driven, multi-camera tactile sensor

    Full text link
    This paper describes the design of a multi-camera optical tactile sensor that provides information about the contact force distribution applied to its soft surface. This information is contained in the motion of spherical particles spread within the surface, which deforms when subject to force. The small embedded cameras capture images of the different particle patterns that are then mapped to the three-dimensional contact force distribution through a machine learning architecture. The design proposed in this paper exhibits a larger contact surface and a thinner structure than most of the existing camera-based tactile sensors, without the use of additional reflecting components such as mirrors. A modular implementation of the learning architecture is discussed that facilitates the scalability to larger surfaces such as robotic skins.Comment: Accompanying video: https://youtu.be/lbavqAlKl9

    Chain of Hindsight Aligns Language Models with Feedback

    Full text link
    Learning from human preferences is important for language models to be helpful and useful for humans, and to align with human and social values. Prior work have achieved remarkable successes by learning from human feedback to understand and follow instructions. Nonetheless, these methods are either founded on hand-picked model generations that are favored by human annotators, rendering them ineffective in terms of data utilization and challenging to apply in general, or they depend on reward functions and reinforcement learning, which are prone to imperfect reward function and extremely challenging to optimize. In this work, we propose a novel technique, Chain of Hindsight, that is easy to optimize and can learn from any form of feedback, regardless of its polarity. Our idea is inspired by how humans learn from extensive feedback presented in the form of languages. We convert all types of feedback into sentences, which are then used to fine-tune the model, allowing us to take advantage of the language comprehension capabilities of language models. We condition the model on a sequence of model generations paired with feedback. By doing so, models are trained to generate outputs based on feedback, and models can learn to identify and correct negative attributes or errors. Applying our method to large language models, we observed that Chain of Hindsight significantly surpasses previous methods in aligning language models with human preferences. We observed significant improvements on summarization and dialogue tasks and our approach is markedly preferred in human evaluations.Comment: included new result

    MMP-2, MMP-9 and activin A blood levels in patients with breast cancer or prostate cancer metastatic to the bone.

    Get PDF
    Background: The clinical significance of the circulating levels of activin A and matrix metalloproteinase-2 (MMP-2) and -9 (MMP-9) was investigated in patients with breast cancer (BC) or prostate cancer (PC) with (M1) or without (M0) bone metastasis. Patients and Methods: MMP-2, MMP-9 and activin A blood concentrations were measured by enzyme immunoassays in 79 cancer patients and in 57 healthy blood donors (HS) who served as a control group. The diagnostic accuracy of these molecules to discriminate between M0 and M1 patients was evaluated by the receiver operating characteristic curve (ROC) and compared to that of tumor markers CA15.3 or prostate-specific antigen (PSA). Results: Activin A and MMP-2 were significantly increased in BC and PC patients as compared to sex-matched HS while MMP-9 levels were more elevated only in the PC patients. Interestingly, in the PC patients, activin A levels were significantly higher than those measured in the BC patients. In this latter group, activin A and CA15.3 but not MMP-2 or MMP-9 were increased in the M1 patients as compared to M0 patients. Furthermore, a significant relationship was also highlighted between activin A concentration and the number of bone metastases and tumor grade, between MMP-9 and tumor grade, and between MMP-2 and CA15.3. ROC curve analysis showed a good diagnostic accuracy for activin A and CA15.3 but a poor accuracy for MMP-2 and MMP-9 in discriminating between M0 and M1 patients. However, CA15.3 retained the best diagnostic accuracy in this respect. In the PC group, only activin A and PSA levels were significantly increased in the M1 patients as compared to the M0 patients. A similar although not statistically significant trend was noted for MMP-9. Interestingly, a significant correlation was observed between PSA and activin A and MMP-9, and between Activin A and Gleason score and the number of skeletal metastases. ROC curve analysis showed a good diagnostic accuracy for activin A, MMP-9 and PSA and a poor diagnostic accuracy for MMP-2 in detecting M1 patients. However, PSA showed the highest diagnostic accuracy. Conclusion: Activin A, MMP-2 and MMP-9 may be regarded as possible therapeutic targets in the treatment of metastatic bone disease. However, their usefulness as additional markers of bone metastasis remains to be better define

    A general framework for high-resolution robotic tactile sensing: design, simulation, and learning

    No full text
    In order to fulfill their potential in the manufacturing and retail sectors of the modern world, autonomous machines need to be able to perceive and react to contact with their surroundings, both to enhance their capabilities, as well as to increase operational safety. This thesis investigates solutions to the contact sensing problem of robotic systems, pivoting on the development of a vision-based tactile sensing principle that provides rich information upon physical interaction with the environment. The sensors based on such a principle are low-cost, scalable to large surfaces and straightforward to manufacture. However, they do not directly measure physical quantities, but rather provide raw data in the form of what are generally known as tactile images. In this work, a machine learning-based data processing framework is presented to address three main requirements, namely, sensing accuracy, efficiency, and generalization across tasks and contact conditions. State-of-the-art sensing accuracy, at a spatial resolution comparable to that of the human fingertip, is achieved through a deep neural network that maps the raw tactile images to the three-dimensional force distribution applied to the sensing surface, which provides a compact and generic representation of the contact state. In fact, the force distribution contains information about the location and the intensity of shear and pressure forces, as well as about the shape and the number of the possibly distinct contact regions. In addition, it provides an interpretable physical quantity that is shown to be very practical for planning higher-level robotic tasks. The size of the neural network is kept compact to ensure real-time inference. However, in the context of data-driven methods, efficiency is also a concern with regard to training data requirements. In this thesis, accurate finite element-based simulations enable the synthetic generation of raw tactile data under a variety of contact conditions. The same simulations also yield appropriate force distribution labels, which are otherwise not possible to collect with currently existing commercial force sensors. Hence, the deep neural network is entirely trained with synthetic data, avoiding the need for real-world data collection. A strategy is then presented that facilitates a seamless transfer of the inference model from simulation to reality, retaining high sensing accuracy. In addition, the model transfers across sensors of the same type without further training. The simulation training facilitates data collection across different scenarios, such as the contact with arbitrarily shaped objects or the combination of shear and pressure interactions. An appropriate choice of learning architecture shows generalization capabilities when applied to contact conditions not present in the training dataset. Beyond the pure sensing task, a proof-of-concept robotic system is presented that fully leverages the versatility of the tactile sensor. The system achieves dynamic manipulation of objects with unknown physical properties solely through the use of tactile feedback fed to a closed-loop control policy trained with a deep reinforcement learning algorithm. In a separate part, this thesis discusses a different research topic, where past experience data are employed to improve the trajectory tracking performance of autonomous systems. This is achieved by estimating unmodeled disturbances over different trials, and including them in the formulation of a computationally efficient model predictive control framework. The approach is demonstrated on two flying vehicle applications, namely, on a vehicle powered with electric ducted fans and controlled through thrust vectoring, and on a quadcopter that aims to balance a pendulum rod during flight

    Sim-to-Real for High-Resolution Optical Tactile Sensing: From Images to Three-Dimensional Contact Force Distributions

    No full text
    The images captured by vision-based tactile sensors carry information about high-resolution tactile fields, such as the distribution of the contact forces applied to their soft sensing surface. However, extracting the information encoded in the images is challenging and often addressed with learning-based approaches, which generally require a large amount of training data. This article proposes a strategy to generate tactile images in simulation for a vision-based tactile sensor based on an internal camera that tracks the motion of spherical particles within a soft material. The deformation of the material is simulated in a finite element environment under a diverse set of contact conditions, and spherical particles are projected to a simulated image. Features extracted from the images are mapped to the three-dimensional contact force distribution, with the ground truth also obtained using finite-element simulations, with an artificial neural network that is therefore entirely trained on synthetic data avoiding the need for real-world data collection. The resulting model exhibits high accuracy when evaluated on real-world tactile images, is transferable across multiple tactile sensors without further training, and is suitable for efficient real-time inference.ISSN:2169-5172ISSN:2169-518
    corecore